Àá½Ã¸¸ ±â´Ù·Á ÁÖ¼¼¿ä. ·ÎµùÁßÀÔ´Ï´Ù.
KMID : 1022420110030020065
Phonetics and Speech Sciences
2011 Volume.3 No. 2 p.65 ~ p.70
Improvement of Rejection Performance using the Lip Image and the PSO-NCM Optimization in Noisy Environment
Kim Byoung-Don

Choi Seung-Ho
Abstract
Recently, audio-visual speech recognition (AVSR) has been studied to cope with noise problems in speech recognition. In this paper we propose a novel method of deciding weighting factors for audio-visual information fusion. We adopt the particle swarm optimization (PSO) to weighting factor determination. The AVSR experiments show that PSO-based normalized confidence measures (NCM) improve the rejection performance of mis-recognized words by 33%.
KEYWORD
audio-visual speech recognition, particle swarm optimization, normalized confidence measure, rejection performance
FullTexts / Linksout information
Listed journal information
ÇмúÁøÈïÀç´Ü(KCI)